A Better Approach for Horizontal Aggregations in SQL Using Data Sets for Data Mining Analysis
نویسندگان
چکیده
To analyzing the data efficiently in Data mining systems are widely using datasets with columns in horizontal tabular layout. Generally preparing a data set is the more complex task in a data mining project, require many complex SQL queries, aggregating columns and joining tables. Conventional RDBMS usually manage tables with vertical form. Aggregated columns in a horizontal tabular layout returns set of numbers, instead of one number per row. This new class of function is called horizontal aggregations. The system uses one parent table and different child tables, operations are then performed on the data loaded from multiple tables. We proposed three fundamental methods .They are SPJ (select-project-join-Aggregation), CASE, and PIVOT. SPJ based on standard relational algebra operators. CASE is useful to exploiting the programming case construct. PIVOT is a built-in operator in a commercial DBMS, PIVOT operator, offered by RDBMS is used to calculate aggregate operations. PIVOT methods are much faster methods and offer much scalability. Partitioning large set of data, obtained from the result of horizontal aggregation. Key Terms: Aggregation; Data Mining; Data preparation; Structured Query Language (SQL); Pivot Full Text: http://www.ijcsmc.com/docs/papers/August2013/V2I8201353.pdf
منابع مشابه
Prepare and Optimize Data Sets for Data Mining Analysis
Getting ready a data set for examination is usually the tedious errand in a data mining task, needing numerous complex SQL queries, joining tables and conglomerating sections. Existing SQL aggregations have limitations to get ready data sets since they give back one section for every amassed bunch. As a rule, a significant manual exertion is obliged to construct data sets, where a horizontal la...
متن کاملHorizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
Data mining is widely used domain for extracting trends or patterns from historical data. However, the databases used by enterprises can’t be directly used for data mining. It does mean that Data sets are to be prepared from real world database to make them suitable for particular data mining operations. However, preparing datasets for analyzing data is tedious task as it involves many aggregat...
متن کاملAnomaly Detection and SQL Prepare Data Sets for Data Mining
Anomaly detection has been an important research topic in data mining and machine learning. Many real-world applications such as intrusion or credit card fraud detection require an effective and efficient framework to identify deviated data instances. However, most anomaly detection methods are typically implemented in batch mode, and thus cannot be easily extended to large-scale problems witho...
متن کاملK Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizontal Aggregations
Data mining refers to the process of analyzing the data from different perspectives and summarizing it into useful information that is mostly used by the different users for analyzing the data as well as for preparing data sets. A data set is collection of data that is present in the tabular form. Preparing data set involves complex SQL queries, joining tables and aggregate functions. Tradition...
متن کاملPreparing Data Sets for the Data Mining Analysis using the Most Efficient Horizontal Aggregation Method in SQL
A huge amount of time is needed for making the dataset for the data mining analysis because data mining practitioners required to write complex SQL queries and many tables are to be joined to get the aggregated result. The traditional SQL aggregations prepare the data sets in vertical layout that is; they return result on one column per aggregated group. But for the data mining project, the dat...
متن کامل